2D1431 Machine Learning Lab 4: Reinforcement Learning

نویسندگان

  • Frank Hoffmann
  • Örjan Ekeberg
چکیده

In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course book Machine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further reading and a detailed discussion of policy iteration and reinforcement learning, the textbook “Reinforcement Learning” is highly recommendable (Sutton and Barto, 1999). In particular studying chapters 3,4 and 6 is of immense help for this lab. The predefined Matlab functions for this lab are located in the course directory /info/mi03/labs/lab4. Dynamic programming refers to a class of algorithms that can be used to compute optimal policies given a complete model of the environment. Dynamic programming solves problems that can be formulated as Markov decision processes. Unlike in the reinforcement learning case, dynamic programming assumes that the state transition and reward functions are known. The central idea of dynamic programming and reinforcement learning is to learn value functions, which in turn can be used to identify the optimal policy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

2D1431 Machine Learning Lab 3: Reinforcement Learning

In this lab you will learn about dynamic programming and reinforcement learning. It is assumed that you are familiar with the basic concepts of reinforcement learning and that you have read chapter 13 in the course bookMachine Learning (Mitchell, 1997). The first four chapters of the survey on reinforcement learning by Kaelbling et al. (1996) is a good supplementary material. For further readin...

متن کامل

2d1431 Machine Learning Lab 3: Instance Based Learning & Neural Networks

In this lab you will learn about instance based learning algorithm (locally weighted regression) and artificial neural networks and apply both techniques to function approximation. You will also learn how to use crossvalidation for parameter and feature selection. You will have to implement the code for locally weighted regression and cross-validation. You will use some existing code for the ba...

متن کامل

2D1431 Machine Learning Lab 2: Bayes Classifier & Boosting

In this lab you will implement a Bayes Classifier and the Adaboost algorithm that improves the performance of a weak classifier by aggregating multiple hypotheses generated across different distributions of the training data. Some predefined functions for visualization and basic operations are provided, but you will have to program the key algorithms yourself. During the examination with the la...

متن کامل

2D1431 Machine Learning Lab 1: Concept Learning & Decision Trees

You have to prepare the solutions to the lab assignments prior to the scheduled labs, which are mainly for examination. In order to pass the lab you present your program and answers to the question to the assistent. Labs can be presented in groups of two, however both students need to fully understand the entire solution and answers. It is also assumed that you complete the assignment on your o...

متن کامل

Incremental Machine Learning to Reduce Biochemistry Lab Costs in the Search for Drug Discovery

This paper promotes the use of supervised machine learning in laboratory settings where chemists have a large number of samples to test for some property, and are interested in identifying as many positive instances for the least laboratory testing effort. Rather than traditional supervised learning where the chemists would first develop a large training set and then train a classifier, the pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003